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Abstract 

In this paper, a many-sources large deviations principle (LDP) for the transient workload of a multi- 
queue single-server system is established where the service rates are chosen from a compact, convex 
and coordinate-convex rate region and where the service discipline is the max-weight policy. Under the 
assumption that the arrival processes satisfy a many-sources LDP, this is accomplished by employing 
Garcia's extended contraction principle that is applicable to quasi-continuous mappings. 

For the simplex rate-region, an LDP for the stationary workload is also established under the 
additional requirements that the scheduling policy be work-conserving and that the arrival processes 
satisfy certain mixing conditions. 

The LDP results can be used to calculate asymptotic buffer overflow probabilities accounting for 
the multiplexing gain, when the arrival process is an average of i.i.d. processes. The rate function for 
the stationary workload is expressed in term of the rate functions of the finite-horizon workloads when 
the arrival processes have i.i.d. increments. 
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I. Introduction 

The drive to achieve maximum efficiency in wireless data networks and high-speed switches 
has lead to many advances in the design of good scheduling policies. One such family of good 
scheduling policies is an online policjo called the maximum weight (max-weight) scheduling 
policy. For the typical multi-class queue where only one queue can be served at a time, the 
max-weight policy serves one of the queues that has the largest value for the product of the 
workload and the service rate. We are interested in applying the max-weight policy to wireless 
networks where the scheduler is able to change the operating parameters at different levels of 
the traditional networking stack. Thus, the server has access to a richer choice of service options 
when compared to a traditional multi-class queue setting: each service option is a point within 
a compact, convex and coordinate-convex rate region. In this setting, the max-weight policy 
naturally generalizes to finding an operating point within the rate region that has the maximum 
projection along the workload vectofl Note that in this setting the traditional single-server multi- 
class queue has a rate-region given by a simplex. 

In the present article, with K independent queues we seek to derive the probability of buffer 
overflow, when the server scheduling follows a max-weight policy. More specifically, for a 
given finite value i? > 0, we consider the two buffer overflow quantities. First, we consider 
P(Wo,T > BIk) where Wo,t E M;^ is the transient workload (to be formally defined later) 
at time with "zero" initial workload at time — T and 1^ E is the vector of all Is. The 
second quantity we study is the stationary overflow probability for the limiting workload vector 
as T ^ oo, i.e., P(W > BIk)- Since these probabilities are, in general, very hard to compute 
exactly, we consider logarithmic asymptotics to the probabilities of interest using the theory of 
large deviations. In particular, this paper, under a "many-sources" scaling regime, establishes 
logarithmic asymptotics for 1) the transient workload for a compact, convex, and coordinate- 
convex rate region; and 2) the stationary workload for a simplex rate region using a work 
conserving scheduler. 

'Online policies are those than can only use the past history of the arrivals, workloads and service decisions to decide on the 
scheduling choice; for example, these policies are not even aware of average arrival rates. 

^In many ways cross-layer optimization has resulted in some firm strides towards a union of information theory and 
communication networks of the sort that was sought in [11]. 
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In the classical^ many-sources asymptotic, one considers a sequence of queueing systems 
indexed by the number of (independent) sources muhiplexed (or averaged) over a particular 
queue, i.e., the arrival process to each queue is the average of L processes. The analysis focuses on 
the asymptotic behavior of the systems when L — oo. In our work, we consider a generalization 
of many sources asymptotic in which the input to the queueing system L exhibits a sample 
path large deviations property (LDP) similar to that of the average of L independent arrival 
streams (See Assumption 1). Given a sample path large deviations principle (see Definition [3]) 
for the arrival processes, we derive a large deviations principle for the workload under the 
max-weight scheduling polic. In particular, we first show that the finite-horizon workload is a 
quasi-continuous map of the arrival process, for both the regular version of the max-weight policy 
and for a work-conserving version of it. Then the first contribution of the paper is that the finite- 
horizon workloads satisfy and LDP. This is obtained using a recent extension of the contraction 
principle by J. Garcia [41]. Restricting our attention to the simplex rate region (corresponding 
to the traditional multi-class single server queue), we again use Garcia's extended contraction 
principle (along with a mixing condition assumption on the arrival process) to establish an 
LDP for the stationary workload. We should emphasize here that in contrast to related "many- 
sources" LDP results on FCFS and Priority policies that can be shown to be continuous, our LDP 
is established for an inherently discontinuous map that results from the max-weight scheduling 
policy. The LDP results (Theorems [H [2l and [3]) directly imply that the probability of buffer 
overflow has an exponential tail whose decay rate is dictated by a good rate function determined 
by the statistics of the arrival process. This rate function can be expressed as a solution to a 
finite-dimensional optimization problem which has the same flavor of a deterministic optimal 
control problem. The final contribution of our work is to provide a simplified form for the 
corresponding rate functions, when the arrival process has i.i.d. increments. 

The outline of the paper is as follows. In Section HIl we briefly motivate and contextualize 
our work in the larger body of literature on LDP analysis of queues as well as cross layer 
scheduling. The problem formulation is given in Section [nil Section |IV] provides background 
and preliminary results. The main results of the paper, which are the LDPs of the workloads, 

'The appellation "classical" is taken from [53] where a general scaling framework is presented that encompasses in a single 
setting all the different scalings used in the "many-sources" scaling regime. 
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are given in Section |V] and proved in Section |Vll Section IVIII gives simplified expressions of 
the rate functions. We conclude in Section IVIIII with a discussion of future work. 

We close this section with a summary of various notation used in this work. We use bold 
letters to discriminate vectors from scalar quantities as well as their components. We denote the 
set of natural and non-negative real numbers by N and M+, respectively. We take ® to represent 
the Kronecker product. For < mi < m2 integers and a vector sequence(Af, t G N) where 
At E M:^ for K E N, we define A(mi,m2] := Xlllmi+i ^* ^^e cumulative arrivals from 
mi + 1 until timeslot m2 where addition applies coordinate- wise. For vector-valued sequence A 
we write A|(mi,m2] to denote the finite subsequence {A-.^^' • • • ! -^-nn-i}- For a vector x E IR+ 
and set B C M^, Projg(x) denotes the projection of vector x on the set B, int(i?) := {X E B : 
3X' E B s.t. A < A'} denotes the set of points strictly inside B, and [x]+ := max{0,x} where 
the function applies coordinate-wise. Lastly, for any given function F : X t-^ y on metric spaces 
X and y, and x E X, we use the notation 

""F := {y Ey : {3xn ^ x) such that F(x„) y}, 

to denote the set of all cluster points (in y) of the images of sequences in X converging to a 
point X E X. Note that ^ -"F, in general, is a correspondence (also called a set-valued function) 
from X to V{y) (the power set of y). However, *^''*F is single-valued at x, i.e., ^F = {F(x)}, 
if and only if F is continuous at x. 

II. Related Work 

In recent years, cross-layer scheduling has become a major focus of research in queueing and 
information theory due to its potential applications in communication networks. For brevity we 
do not list many important works, and instead mention only those closely related to this work. 

In our LDP analysis, we follow the lead of many recent papers on the analysis of scheduling 
algorithms [9], [10], [12]-[16], [18] by considering logarithmic asymptotics to the probabilities 
of certain rare events. Maximum weight scheduling policy falls under class of the generalized 
CyU. -rule policies and is known to be stabilizing under very mild conditions [l]-[5]. A refined 
analysis of this policy shows that it minimizes the workload in the heavy traffic regime [6]-[8] 
over a large class of stationary online policies. This optimality of the max-weight policies also 
carries over to Large Deviations based tail asymptotes: the work-conserving version of these 
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policies is known to minimize the exponent of the tail asymptote of the stationary workload 
over a large class of stationary, online and work-conserving policies [10]. 

The present paper is closely related to [14], [15], where the buffer overflow probability for the 
workload processes of a single-server multi-queue queueing system under max-weight policies 
and general compact, convex and coordinate-convex capacity regions was established. While 
[14], [15] addresses the "large-buffer" scaling regime, this paper establishes similar logarithmic 
asymptotics results under the "many- sources" scaling regime (see [16]-[26], [30]-[39]). As the 
body of work on the "many- sources" scaling regime has grown, results have been established for 
many different scheduling policies for a single-server queue and also for networks of queues, 
namely, FCFS [20]-[22], [24], [26]-[28], [30]-[33], [38], [39], priority queueing [18], [24], 
[26], [36], [38], GPS [37], and SRPT and similar policies [16], [17], [34]. Our LDP analysis of 
max-weight scheduling is strongly motivated and complemented by these papers. 

Finally, we close this section with a discussion of our motivation to consider the "many- 
sources" scaling studied in this apper. The interest in the "many-sources" asymptotic is known to 
be best motivated by 1) a recent and practical interest in applications when there are large number 
of flows to each user or node. This asymptote usually gives a more refined approximation to the 
probabilistic quantities of interest by incorporating the impact of the multiplexing gain [16]-[39] 
obtained by averaging many traffic sources together. However, our interest in the "many-sources" 
asymptotic has also been fueled by our earlier work [40] on 2) a cross-layer optimization of 
the PHY layer parameters, e.g., duration of the finite code blocks or cooperative cluster size 
when the fading channel is operated at high signal-to-noise-ratio (SNR). The high SNR regime 
is a very natural setting for the many-sources scaling since the capacity of the channel typically 
scales to infinity as log(SNR), and therefore it is natural to scale the arrival rate of the flows with 
the same parameter, which is best accomplished by multiplexing more sources; in other words 
by setting L oc log(SNR). The present work on the many sources large deviation analysis of 
max-weight provides a first step in extending the above cross-layer optimization to a multi-user 
setting, an important topic for future research. 

in. Problem Formulation 

We consider a discrete-time queueing system with K E N independent queues and one server. 
We are interested in the statistical properties of the unfinished workload in queue k at time t 
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under a max-weight server allocation policy. Let Wf^ G M+ be the unfinished workload (queue 
length) of queue k at the beginning of time —t and be the amount of service allocated to 
queue k during time (— t, —t+1]. Let := {W^, k E IC) he the corresponding workload vector 
and Ht := /c G /C) be the rate vector. For every queue k E JC := {1, . . . , K} we assume 
that work (in bits) arrives into the queue given by a sequence (v4j,t G N) where A'l E IR+ is 
the work brought in at time —t. For t E N, the dynamics of the workloads of queue k E JC is 



Note that we assume that the arrivals At happen any time in (— t, —t + 1) but cannot be served 
in that timeslot —t. 

The set of server's operating points is restricted to a compact and convex set TZ C known 
as the capacity or rate region of the server, i.e., Rt G TZ, for all time t. We make the simplifying 
assumption that bits are infinitely divisible so that the rate allocations can be assumed to be real 
numbers. Furthermore, we also assume that TZ is coordinate-convex, i.e, if /xi G TZ, then every 
A*2 £ such that < /xi is also in TZ where the inequalities apply along each coordinate. We 
are interested in the max-weight scheduler, and its closely related work-conserving version. At 
the beginning of timeslot —t, the rate vector Rt G 7^ is selected by a max-weight scheduler in 
response to the current workload W^. Specifically, under max-weight scheduler and in response 
to the current workload Wf, the rate vector R^ is chosen such that 



As later established by Lemma [H it is possible to construct a quasi-continuous (see Defn \T\ 
in Section ITVl) function H such that = H(Wt). We call this construction the (max-weight) 
scheduling function. We also define a non-idling modification of max-weight scheduler for which 
the rate of service R^* is such that it splits the service when the unfinished workload in each 
queue k is less than C'' = maxji?'^ : R G TZ}. Lemma [U also shows that it is possible to 
construct a quasi-continuous function H""^ such that R" = H^'^(Wt) and 



When necessary we will distinguish the workload vectors that result from the work-conserving 
max-weight by labeling them as W'^'^. 



Wt, = [W,'-Rl]^ + Al. 



(1) 



HI E arg max < R, > . 



(2) 




(3) 
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As mentioned earlier, in this paper we are interested in the probability distributions for the 
finite -horizon and infinite-horizon workloads. The finite-horizon workload, denoted by Wq^Tj is 
the workload at time 0, assuming the initial condition at time — T is W^^ = . The index T 
in Wq^t reminds us of this initial conditiono The infinite-horizon workload, W, is defined as 
W := lim^^oo Wq t- We assume that the limit exists but may be infinite. Note that our results 
for the infinite-horizon workload are obtained in the restricted setting of a work conserving 
max-weight scheduler operating on a simplex rate region. For this work-conserving max-weight 
scheduler, it is known that W"^'^ is the stationary workload when the system is stable. 

We will use functions Gt and G^'^ to relate the arrival process A|(o,r] with the unfinished 
work under max-weight scheduling, Wo,t and under work conserving max-weight Wq^, i.e. 
Wo,T = Gt(A|(o,t]) and W^^^ = G't'(A|{o,t])- Similarly, we also define function to 
describe the workload under work-conserving max-weight when the arrival sequence is given, 
i.e. W"'^ = G"^'^(A); we do not indicate the rate-region here as it is implicitly understood to be 
the simplex rate-region. 

For each user k E JC and system indexed by L G N, we will assume a stationary arrival 
process of work brought into the system given by a sequence A^''" := {A^'^,t G N) where 
Aj'^ G IR+ is the work (in bits) brought in at time —t into the queue of user k. The arrivals to 
different queues/users are assumed to be mutually independent. Also let A^ := (A'^'^, k E JC) he 
the sequence of arrival vectors. In our large deviation analysis, we characterize the asymptotic 
probability distributions for the finite-horizon and infinite-horizon workloads, G'j'(A^|(o,t]) and 
G?;"(A^|(o,T]), as L ^ cx). 

We close this section by noting that in its typical avatar a max-weight scheduler is also 
accompanied by a set of non-zero weights /3 E M;^ to weigh the workload vectors while 
determining the max-weight service vector. Using the observations in [14], [15] it can be seen 
that there is no loss of generality in assuming that the weight vector is Ik- 

''Note that the result remains valid even when the initial condition is within TZ with the work-conserving scheduler. With 
Wr £ TZ, we always have the workload at time — T + 1 be Wt-i = [Wt — //""^(Wt)]"*" + At ~ At from the non-idling 
condition that we imposed on the server allocation mechanism as Proj^(WT) = Wt- 



Febraaiy 26, 2009 



DRAFT 



8 

IV. Background and Preliminaries 

In this section, we provide a brief review of fundamental definitions, concepts, and relevant 
results in large deviations theory that are essential for understanding our paper. Except for 
Lemma [H which establishes the quasi-continuity of the scheduling functions H and H^'^, the 
material in this section can be found in [24], [26], [41], [51]. 



A. Quasi-continuity and Almost Compactness 

In this section, we recall two important analytic properties for functions on metric spaces: 
quasi-continuity and almost compactness. These properties allow for an extension of contraction 
principle to which we later appeal. First, let us provide the definition of quasi-continuity on 
metric spaces: 

Definition 1: [41, Theorem 3.2] Let X,y be complete metric spaces. Function F : X t-^ y 
is quasi-continuous at x G A" if and only if for each x E X, there is a sequence such that 
Xn X, F(xn) — ^ F{x), and such that for all n, F is continuous at a;„. 

Remark 1: Obviously, every continuous function is quasi-continuous. A step function F : 
M R, where F(x) = for a: < 0, F{x) = 1 for x > 0, is quasi-continuous. However, if 
F(0) = 1/2, then F is not quasi-continuous. 

Remark 2: An important property that we will user later is that if F is a continuous function 
and G is a quasi-continuous function, then F o G is quasi-continuous. However, G o F is not 
necessarily quasi-continuous [41]. 

As stated before, max-weight and its work conserving variation allow for quasi-continuous 
function selections. 

Lemma 1: There exist quasi-continuous functions H and H^'^ such that: 

H(Wt) e argmax < R, Wt >, 

and 

I H(Wt) otherwise 

Proof: See Appendix |Al The proof relies on the structural properties of the scheduling 
maps. ■ 
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Having provided the definition of quasi-continuity, we are ready to define almost compactness 
for a function: 

Definition 2: [41, Lemma 6.1] If X,y are complete metric spaces, a function F : X ^ y 
is almost compact at x G A" if for every sequence Xn converging to x, there is a subsequence 
along which F converges to a point y ^ y . We say that F is almost compact if it is almost 
compact at every x ^ X. 

B. Topology for Sample Paths 

Since a large deviations principle is defined using topological entities and since we will deal 
with continuity and convergence of the workload mappings, we need to precisely specify the 
topology for the space of jthe arrival sample paths. We use the scaled-uniform norm/weighted 

used in [24] for our analysis. 



ogyE 



supremum norm topology 

Let P C {x : N (-^ M+} denote the space of non-negative sequences such that supjgp 



x(0,f] 
t 



< 



+00, and let be the K cartesian product of V. Let 1 1 ■ 1 1„ be the scaled uniform norm on V, 



I.e., ||x||,j :— supjgjsj 



x{0,t] 

t 



for all X E V, while for all a = {a'' , k E JC) E , where E V, 
the scaled uniform norm of a is \\a\\u := max^g^c I ]„. The space V is metrizable via the 
scaled uniform norm || ■ i.e., for all x,y E V, the distance between them is ||x — y\\u = 
sup^gj!^ . Define a subspace of V which contains all the arrival paths whose average 

arrival rate is equal to the expected rate /i, i.e., := i^x E V : limt_^oo ^^^^7^ = Also for a 
vector fj, = (yUi, . . . , fi^) define Vj^ to be the product space of V^k for all k E tC. We equip 
and with the appropriate subspace and product topologies [42]. For finite dimensional metric 
spaces like M" , n G N, we use the square uniform topology (sames as the product topology) 
with the square metric p [42], where p(x, y) := maxjgji „} |x' — From [24], [26] it is 
also clear that the scaled uniform norm topology is stronger than the point-wise convergence 
topology, hence the projection and shift operators are continuous under the scaled uniform norm 
topology. 

^In the theory of weak convergence of probabihty measures this topology was first proposed in [43], [45] for continuous 
functions vanishing at infinity with a continuous index set. In the same context it was then generalized to cad-lag functions with 
a continuous index set in [44]. The most general setting of this topology for discrete-time processes can be found in [46], [47], 
and the corresponding setting for continuous-time processes can be found in [48]. The usage of this topology in the context of 
Large Deviations can be found in [49] and [50]. Finally, the central theme in [24], [26], [50] is to demonstrate how this is a 
natural topology to use in the queueing context. 
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C. Large Deviations Principle 

The following definition of a large deviations principle is taken from [24]. For a complete 
reference to the theory, definitions, and tools, see [51]. 

Definition 3 (Large deviations principle): A sequence of random variables in a Hausdorff 
space X with cr-algebra B is said to satisfy a large deviations principle (LDpJ^ with good rate 
function / if, for any B E B, 

- inf J(x) < liminf ^logP(X^ e B) < limsup ^ logP(X^ e 5) < - inf J(x), (4) 

where B° and B are the interior and the closure of B, respectively, and if the rate function 
/ : A" I— i> ]R+ U {oo} has compact level sets, where the level sets are defined as {x : I{x) < a}, 
for a G M. 

If is a mapping from N to M describing the sample path of a random sequence, the LDP 
is referred to as a sample path LDP. 

D. Garcia's Extended Contraction Principle 

The contraction principle (see [51, p. 126]) says that if we have an LDP for a sequence of 
random variables, we can effortlessly obtain LDP's for a whole other class of random sequences 
that are obtained via continuous transformations. However, due to the inherent discontinuity in 
the max-weight scheduling function, this (regular) contraction principle fails to provide sufficient 
structure in the setting of our interest. Instead, we will utilize the following powerful extension of 
the contraction principle for quasi-continuous transformations on metric spaces, given by Garcia 
[41]. Garcia's extended contraction principle [41, Theorem 1.1] then says the following: 

Fact 1: Assume that Vt ^ X ^ y, X ,y ax& metric spaces, and {X } satisfies an LDP in 
X with rate function If at every x with /"(x) < oo, the following hold: 

1) F is almost compact; and 

2) for all y G ^F, there exists a sequence converging to x such that F{xn) —* y, F is 
continuous at x„, and /"(x^) — /"(x), 

then {F(X^)} satisfies an LDP with rate function given by 

I{y) = m{{P{x):ye ^F} . (5) 

^Often B is taken to be the Borel a-algebra, and a rate function is, by definition, non-negative and lower semicontinuous. 
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Remark 3: Whenever /^(■) is continuous, then the second condition is reduced to confirming 
that F is a quasi-continuous function. 

V. Assumptions and Overview of the Results 

Garcia' s extended contraction principle together with Lemma \T\ suggests the following road 
map to obtaining an LDP for the finite and infinite horizon workload processes under max- 
weight scheduling. The large deviation property for the sequences of finite- and infinite-horizon 
workloads would follow as a direct consequence of the sample-path LDP of the arrival process, 
as soon as one establishes the quasi-continuity and almost compactness of the mappings Gt, GJ^, 
and G"'^ along with some continuity properties of the rate function obtained from the sample-path 
LDP assumption on the arrival processes. As a result, our first task is to restrict our attention to 
arrival streams that satisfy the sample-path LDP as stated by Assumptions [T]l3] in Section IV-AI 
There we also discuss a family of arrival processes which satisfies these assumptions. 

A. Sample Path LDP of Arrival Processes 

The following sample path LDP for the sequence of arrival processes is the starting point 
of our analysis. 

Assumption 1 (Many-sources sample path LDP): The sequence {A^} satisfies a sample path 
LDP in equipped with the scaled uniform topology with rate function P, where the rate 
function P is given as 

/«(a) := sup J«t(a|(o,i]) = lim /»t(a|(o,t]) (6) 

for a G Vj^, where for every t G N we also assume that {A^|(o,t]} satisfies an LDP with rate 
function (in the product topology). 

Remark 4: The most general conditions for Assumption[T]to be satisfied are given in [24, Thm. 
3] (also stated in [26, Thm. 7.1, pg. 156]). There it is also shown how several standard stationary 
processes used for traffic modeling, such as i.i.d. increment processes, Markov-modulated, a 
general class of Gaussian, and fractional Brownian processes (for long-range dependent or heavy- 
tailed traffic), satisfy Assumption [T] The conditions of [24, Thm. 3] also imply that the sequence 
{A^} also satisfies an LDP on equipped with the scaled uniform topology, with rate function 
P where P{a) = cx) for a G /V^ [24], [26]. Finally, it is shown in [26, Lemma 7.8] that 
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under the conditions of [24, Thm. 3], for all t G N we have iKifJ- ® = and also that 
is convex. 

In this paper, we further assume the following continuity conditions on the rate functions: 

Assumption 2: We assume that Pt{-) is continuous in the product topology on 3?^^*. 

Assumption 3: For every point x in the effective domain of i.e., {a G Vj^ : < +oo}, 
we assume that for every sequence {a"} converging to a in Vj^ such that there exists t G N so 
that a" = a^ for all s > t (for all n), we have /''(a") — > /"(a). 

Remark 5: Assumption[3]is a manifestation of a mixing condition that provides a certain inde- 
pendence of the long-term behaviour of the process with respect to any finite initial block/window. 

In Proposition [H below we demonstrate that processes with i.i.d. increments naturally satisfy 
Assumptions [T]l3l Before proving this we define a coercive function as follows. 

Definition 4: A function / : ]R_|_ i-^ IR+ U {+00} with domain Dom(/) := {x : f{x) < +00} 
is defined to be coercive if for every y G IR+, there is compact set C C Dom(/) such that 
/(x) > y for all x G Dom(/) \ C. 

Proposition 1: For scale parameter L assume that the arrival process has i.i.d. increments 
while the arrival processes for different users are independent. Furthermore, assume that for 
every k, {A^'^} satisfies the conditions of the Gartner-Ellis Theorem [51, Thm. 2.3.6] and that 
the limiting rate function is either coercive or has domain 3?+. Then Assumptions [Il|3] hold. 

Proof: The fact the above class of processes satisfy Assumption [T] is immediate from [24, 
Thm. 3]. Similarly the Gartner-Ellis Theorem [51, Thm. 2.3.6] yields convexity and from the 
coercivity of the rate function or with its domain being IR+, Assumption [2] follows. When the 
arrival process has i.i.d. increments, then it also follows that for y G M+^* and x G , 

A(y) = EEAr'(i/3)> 

fc=l s=l 

and 

K +00 
k=l s=l 

where A* '^ is the Fenchel-Legendre transform of A^{6) := lim^^^oo ^ ^og i^^e^^^^i . Note that 
that /«(x) = +00 if X G P^^' \ V^. 

Now for every sequence {x"} converging to x in Vj^ with /"(x) < +00 such that there exists 
t G N so that x" = Xs for all s > t (for all n), we have /*(x") < +00 for all n large enough 
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and 



J»(x")-J«(x) = J«,(x"1(o,])-/»,(x|(o,]) 



^ 0. 



Remark 6: A more general characterization, especially of Assumption[3l remains an important 
area of future work. However, to provide insight about Assumptions |2ll3] and their relationship 
to Assumption [T] we provide the following simple example. Our example will be constructed 
as averages of L stationary i.i.d. random processes so it is sufficient to describe the underlying 
stochastic process: We let the random process be Xk = 'j + 'jV when k is odd and Xk = 7 — 
when k is even, where V" is a Uniform[— 1, 1] random variable and 7 > is the long-term mean 
for all our sequences. Here it can be verified that I(x) < +00 if and only if Xk = 7(1 — a) if 
is even and Xk = 7(1 + a) for k odd where a E [—1, 1]. It can also be verified that I{x) = for 
the all-7 sequence, i.e., Xk = 7. However, it is clear that one can easily construct sequences as 
required in Assumption [3] with the rate function being infinite which, nevertheless, converge to 
the all-7 sequence. In other words. Assumptions [2ll3] require more than the regularity conditions 
from [24], [26] that guarantee validity of Assumption [T] 

Given the above assumptions on the arrival processes, we are now ready to provide an overview 
of the main results of the paper. 

B. Main Results: An Overview 

Assuming that the sequence of the arrival processes {A^} satisfies a many-sources sample- 
path LDP with a "continuous" rate function (Assumptions HHSj), LDPs for the finite and infinite- 
horizon workloads will be a direct consequence of Garcia' s extended contraction principle once 
the required quasi-continuity and almost compactness properties are shown. We will demonstrate 
that the quasi-continuity and almost compactness of the finite horizon workload mappings are 
inherited from the quasi-continuity of the schedulers H and H^'^. 

Theorem 1: Under Assumptions [ill2] and for all t G N, the sequence of the finite-horizon 
workloads under a max- weight scheduler {Wq^ := Gt(A^|(o,f])} satisfies an LDP on with 
the rate function It, where for b e M^^ 



if xi. X 



inf 




(7) 
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Theorem 2: Under Assumptions [THll and for all t E N, the sequence of the finite-horizon 
workloads under the work-conserving max-weight scheduler, {Wqj := G'"'^(A^|(o,t])}, satisfies 
an LDP on M;^ with the rate function It, where for b G M;^ 

/,(b)= inf A(x). (8) 

The quasi-continuity and almost compactness of the stationary workload mapping, however, 
requires slightly more work as shown in Section |VIl In fact, unlike the finite-horizon LDP results 
of Theorems \T\ and [2l which hold for general rate regions and for both max-weight and its work 
conserving version, the infinite horizon LDP result of Theorem [3] is established only under the 
work conserving max-weight policy and only with a simplex rate region. For a vector x G 
define x := J2k=i /C^^ then the simplex rate region is given by, 

7^, := {r G M:^' : r < 1} . (9) 

Theorem 3: Consider a work conserving max-weight scheduler with a simplex rate region 
TZs. Let fj, G M^^ be an admissible arrival rate vector strictly inside this simplex rate region, i.e., 
fi G 'mt(Jls) and {A^} be a sequence of arrival processes that satisfies Assumptions [1113] in Vj^. 
The sequence of infinite-horizon workloads {W^ := G™'^(A^)} satisfies an LDP on Wf with 



good rate function J, where for b G 



J(b)= inf J«(a). (10) 

Note that all the rate functions have the appearance of being what one would naturally expect, 
i.e., among all arrival sequences that result in the required workload at the required epoch, find 
the one with the least cost to deduce the rate function. However, the discontinuity of the queueing 
map makes this a non-trivial assertion and also enlarges the set of allowed arrival sequences. 

VL Analysis: LDP's for Workloads 

In this section, we prove the main results of the paper: establishing LDP for the sequences of 
the finite-horizon and infinite-horizon workloads. We first delineate the proof for the LDPs for 
the sequence of the finite-horizon workloads. 
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A. LDP for Finite-Horizon Workloads 

In this section, for t G N, we establish an LDP for finite-horizon workloads {Wq^ := 
G7'=(A^|(o,t])} and {W|^^ := Gt(A^|(o,t])}. The approach is to first show that the mappings 
GJ^ : M;^* (— s> M;^ and Gt : IR;^* i— > M;'^ are quasi-continuous and almost compact and use 
Garcia's extended contraction principle to obtain an LDP for the finite-horizon workloads from 
the LDP assumption for {A^|(o,t]}. From FactfH the almost compactness and quasi-continuity of 
workload mappings are sufficient to establish an LDP for finite-horizon workload, since according 
to Assumption [2l is continuous (in the product topology). 

Using the quasi-continuity of the scheduling function we now prove the required quasi- 
continuity of the workload maps. First we consider the work-conserving max- weight scheduler. 

Lemma 2: For t eN, GJ^ is quasi-continuous with respect to the scaled uniform topology. 
Proof: See Appendix |B] The induction-based proof uses the quasi-continuity of the scheduler 
H and the linear dependence of the workload at time —s on A^+i for all s G (0, t — 1]. ■ 

Similarly, in order to establish an LDP for the workload process under the maximum weight 
scheduler, we have the quasi-continuiy of function Gt- 

Lemma 3: For t G N, G'j(-) is quasi-continuous with respect to the scaled uniform topology. 
Proof: See Appendix |Bl In this induction-based proof we establish certain analytical prop- 
erties of the workload map for all possible rate-regions that then allows us to obtain a quasi- 
continuous selection. ■ 

Having established quasi-continuity, the next result demonstrates the almost compactness of 
the workload maps. 

Lemma 4: For t eN, both Gt and G^'^ are almost compact on with respect to the scaled 
uniform topology. 

Proof: Since the workload at any time cannot exceed the amount work brought in from 
the last time the system was empty irrespective of the scheduler used, we automatically get the 
following bounds 



This implies the almost compactness of Gt and G^'^. Since every sequence {a"} converging 



G, 



i(a|(o,t]) <a(0,t], Gr(a|(o,t])<a(0,t]. 



to a (in P;^ 

H' 



in the scaled uniform norm topology) converges point-wise too, it follows that 
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Gt(a"|(o,t]) and G7'^(a" |(o,i]) will lie in a bounded subset of so we can use the Bolzano- 
Weierstrass theorem. ■ 
Now, as already discussed, the proof of Theorems [U and [2l is complete. 

Next, we discuss the LDP for the infinite-horizon workloads for the work-conserving max- 
weight scheduler with a simplex rate-region. 

B. LDP for Infinite-Horizon Workloads 

In this section, we establish an LDP of the sequence of the infinite-horizon workloads {W^ = 
G^''{A^)} where e Vj^, fi e int(7^s), and {A^} satisfies Assumptions [DlSl Similar to the 
last section, we first show that the mapping G""^ is quasi-continuous and almost compact on X)^ 
and then use Garcia's extended contraction principle to establish the desired LDP. 

The following lemmas establish the necessary steps to apply Garcia's contraction principle 
to the stationary workload map. The first of these lemmas relates the infinite horizon workload 
mapping to that of a finite horizon. 

Lemma 5: Consider an arrival process a G Vj^. There exists a s* = s*(a) < oo such that 
the workloads at time —s* under a falls within the rate region TZg, i.e., G'^'^(a|(5. oo)) G TZg. 
Furthermore, for any sequence of arrival processes {a" G V^} converging to a (in scaled 
uniform topology), the workloads at time —s* under a", when n is large enough, also fall within 
the rate region Tig, i.e., 3no such that G'^'^'^''(a"|(s.^oo)) ^ Tls for n > riQ. 

Proof: See Appendix O The main idea is to use the fact that a suitably normalized sum 
(over all queues) workload process behaves like the workload of a single server queue with 
only one flow and a work-conserving service discipline. This allows us to use the continuity of 
the workload mapping of a single server queue and the stability of the queue to arrive at the 
assertion of the lemma. ■ 

Now we are ready to verify the requirements of Garcia's extended contraction principle. 

Lemma 6: Let a G Vj^ be an arrival process with rate fi G int(7^s), and G"'^(a) be the 
corresponding infinite-horizon workload. For any W G '^G™'^ there exists a sequence of arrivals 
{a" G Vj^} such that a" converges to a in scaled uniform topology, G™'^(a") — > W, G™'^ is 
continous at a", and /"(a") — > 

Proof: See Appendix O The idea is to relate the infinite horizon workload, using Lemma [H 
to a finite-horizon workload mapping whose quasi-continuity was established earlier. ■ 
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Lemma 7: The mapping G'"'^' " is almost compact on T>^ with respect to the scaled uniform 
topology. 

Proof: See Appendix. The proof is along the same lines as that of Lemma |6l ■ 
Again, the above lemmas and Garcia' s extended contraction principle immediately yield the LDP 
for the sequence of the infinite-horizon workload in Theorem [3l 

Let us now consider the problem of calculating the rate function. Eqn. (flOl) suggests that the 
rate function J, where J(b) = inf^^p^^ aQwc^j^ Jf (a), could be interpreted as the minimum-cost 
solution among all paths a G such that b G '^G"'^, where the cost of the path a is /"(a) 
and convex. Hence, the problem of finding the rate functions is a deterministic optimal control 
problem like those in [12], [14]. However, the expressions for the rate functions It and J in ([7]) 
and (flOl) are of little use in their current forms, as their computation is far from straight forward. 
In the next section, we simplify the rate functions when the arrival processes are limited to 
having i.i.d. increments. 

VIL LLD. Increments: Simplified Rate Functions 

In this section, we give a calculation of the finite-horizon and infinite-horizon rate functions in 
the case when the arrivals have i.i.d. increments. In this case, the cost of a sample path a G X'^S 
which is /"(a), is additive and the total cost of any arrival sample path is the sum of the cost over 
all timeslots and queues. This property helps us to simplify the calculation of the rate functions. 

Consider the underlying arrival process A to be a process with i.i.d. increments, e.g., a 
compound Poisson arrival process with exponential packet length (see [40]). For these i.i.d. 
increment arrival processes, as mentioned earlier, we have 

K t K +00 

k=l i=l k=l i=l 

Next, we simplify these rate functions. 

A. Infinite-Horizon Rate Function 

The following lemma expresses the infinite-horizon rate function J as the infimum of the 
finite-horizon rate functions It over all time t. 
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Lemma 8: For i.i.d. increment arrival processes with ^ E 'mt(Jls), the infinite-horizon rate 
function J is simplified as 

J(b) = inf/i(b). (11) 

Proof: The cost of a sample path over time is the sum of the cost of arrivals in all timeslots. 
As in the proof of Lemma [6l for a G Vj^ where /x G 'mt(JZs), we can find t := s*(a) such 
that W((a) G TZs- Hence, for a such that b G '^G™'^, one can reduce the cost of the path by 
constructing a new sample-path a by setting a^, = ^ for all v > t and a^ = a^ for all < t 
while still satisfying b G '^G"''. This is because /"(a) = /ft(a|(o,t]) < /*(a). On the other hand, 
since Wt(a) G TZs, we can write b G '^ko.tJGJ'^ All of these imply that 

J(b) = inf J«(a)=inf inf IKM = mi Uh), 

aGl5/,^-:^G""3b *>1 x6Rf ;^Gr9b t>l ' ' 

by the definition of It{h) in ©. ■ 

With this simplification available, we now look at the finite-horizon rate function in more 
details. 



B. Finite-Horizon Rate Function 

In this subsection, we provide a further simplified expression of the finite-horizon rate function 

If 

Lemma 9: For t eN, the finite-horizon rate function It is simplified as 

/t(b) = min ( J»i(b), min inf /*„(x)) (12) 

y Mg(l,i] xeA(u,b) J 

for b G M;^, where for u > 1 

Aiu, b) := {x G M^"" : b G ''G^, G:^„(x| ^ 7^„ e - 1]}. (13) 

Proof: This follows the idea from the proof of Lemma [H Let t G N. For a G such that 
b G ^Gr, we let 

« = min|t,min{s G [1, t - 1] : W, = Gr,(a|(,,t]) G 7^,} |. 

In other words, — n is the last time the workload vector is inside the capacity region TZg before 
time 0. With this definition of u, for all m > 1 we have W^, ^ TZg for all f G — 1]. By 
definition of It, we already know that the workload vector starts initially inside TZg at time —t. 
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Therefore, we can find another path a G Mf"* with a reduced cost while keeping the workloads at 
time —u+1 to (i.e., W„_i to Wq) intact by setting a^, = fj,,\fv E (u, t] and = a^, otherwise. It 
is easy to see that we have Pt{si\io,t]) > /''«(a|(o,„]) = /«t(a|(o,i]) and yet b G ^1(0.-10^ = ^G^- 
The same logic also applies to the case when u = 1. However, in this case the only way that we 
can achieve a workload vector b at is if exactly that amount of work arrives, i.e., if ai = b. 
Since by definition W^, = G"f.^(a|(^^u]) for v G [l,u— 1] (when u > 1), we have 



/,(b) = inf^,,^^ /»*(x|(o,i) = min l\{h), min inf_,.,^ /« 



X 



X/^ WC 



mm 



/^(b), min inf J''„(x|(o.n]) ) 
^ «e(i,t] xgA(u,b) <y ■ I' J 



where A(n, b) is defined as in (fT3]) . ■ 

Remark 7: The above lemma reduces the set of feasible sample paths to the set A(n, b) 
for u G (0,t]. It is interesting to note the property of the sample-paths in this set. For any 
X G A(u,h), we have Wo(x) = x(0,m] — (m — 1) = b, recalling that the ' notation is the 
normalized sum of the elements of the vectors. There is no wastage of service capacity over the 
u — 1 timeslots because Vt' G [l,u — 1], W,„ = G'"^:^(x|(^ „]) ^ TZg and hence W^, > 1. That is, 
any sample path x G A(n, b) has its normalized sum of the arrivals over time (0, u] and queues 
equal to x(0, u] = b + (n — 1). 

In addition, an immediate implication of Lemma |9] is that we can rewrite J in (flOl) as 

J(b) = inf Uh) = inf ( Ji(b), inf Jt(b)) = inf ( J«i(b),inf min inf lK( 

t>l V t>2 ' V V t>2u€il,t] xGA(«,b) 

= inf f/«i(b),inf inf lK{^ko,ti] , (14) 

\ t>2xGA(t,b) ' Jy 

where we also used the fact that /i(b) = /''i(b). If we denote t* as the optimizer of the last 
equation, then t* is called the critical timescale (see [24]). It can then be interpreted that t* is 
the most likely length of time it would take to "fill" the buffers to a given level b from being 
"empty" (more precisely, anywhere within 7^^). 

Remark 8: The induction-based proof of Lemma [21 reveals another important property of the 
sample-paths, namely, that at every stage it suffices to consider the quasi-continuous selection 
H'"^(-) of the scheduling function. This then implies that we need to consider all valid arrivals 
sequences that respect the constraints of the set A(-, b) such that using any of the allowed (based 



l(o,«] 
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on the workload vector at each time) scheduling actions given by H^'^{-) results in the workload 
vector b at the required epoch. 

Note that for fixed n G N, infxgA(«,b) lK{'^\{o,u]) is a optimization problem, with a convex cost 
function /^ii(-) and a set A(u, b) of allowable solutions, in general, a K(u — 1) -dimensional set. 
This problem is difficult to solve analytically. Since the cost function Pu{^\{o,u]) = J2^=i SfcLi ^i '^ 
is additive, a possible numerical method is the numerical backwards induction of dynamic 
programming. However, the method suffers from the curse of dimensionality and hence is not 
practical for large u and b. Hence, we turn our attention to finding some simplified bounds 
of the rate functions. This can be done by employing the additivity and convexity of the rate 
function Pt. Next we present some bounds for the case when K = 2. 

C. Properties of the Minimum-Cost Sample Paths 

Here, we see that the convexity of the cost function A^ '' for all k E K., induces two properties 
for the optimal paths. 

Property 1: Constant-speed linear path is the cheapest. Among all arrival sample paths x G 
with the only constraint that x(0, t] = b, i.e., the total amount of arrivals at the end of time 
t eN equals b, the cheapest or minimum-cost path is the constant- speed linear path, where the 
arrival in each timeslot is equal to h/t. 

Proof: This is because the path cost function is additive, i.e., A*(x) = J2k=i Yll=i 
and the per-timeslot cost function A*''^ is convex. Applying Jensen's inequality [54] gives 



A;(x) = i:E 

k 



1 1=1 k=l \ i=l J k=l 



with equality when = b'^/t for all i and k. See an illustration in Figure [T(a) 



From now onwards, without loss of generality, we assume that the arrivals, workloads and 
service vectors are scaled by 1/C where C = (C^, . . . , C^^) is the vector of maximum service 
rates. In this normalized setting, the maximum service rate of all the users is 1. Assume also 
that A^ '' for A; G /C is suitably modified for this scaling, and, for ease of exposition, that with 
this scaling the processes are statistically identical, i.e., A^''^ = AJ for all k E JC. We can then 
write down the following property. 

Property 2: Constant-speed linear path closest to the equal line is the cheapest. For constant- 
speed linear paths a G M^* with the sum a(0, t] = d, the cost of the path is a Schur-convex 
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function [55]. 

Proof: Since the arrival paths are constant-speed linear path, without loss of generality we 
can consider arrival paths in a single timeslot. Consider path x G M^, then the cost of the path 
is ^^=i^i{x^) where A^(-) is a convex function. Therefore from the results in [55] it follows 
that X]fc=i^i(^'^) ^ Schur-convex function. In order words, if x is majorized by y G 
(denoted by x ^ y), i.e., ^^^^ < YJk=i 2/'" for j = 1, . . . , - 1 and Y.k=i = Ef=i 
where is the z*^ largest component of x, then < Yl!k=i ^liv'^)- 

This is easily appreciated when K = 2. Let x = {x,d — x) G and y = {y,d — y) G M^, 
where y > x > d/2, then we have Al{x) + Al{d - x) < Al{y) + Al{d - y), and x is cheaper 
than y. We illustrate this in Figure |l(b)[ ■ 

These properties are also used in [9], [12], [13] for large-deviations analysis of scheduling 
disciplines. Next, we use these properties to calculate I2 and bounds on It for t G N for just the 
work-conserving scheduler operating on the simplex rate-region. 



D. Example: Calculation of I2 

Here we look at an example for calculation of the finite-horizon rate function It to illustrate 

that the calculation continues to be rather involved. For simplicity, consider the case when t = 2 

and K = 2. From ([HI), /2(b) for b G IR+ can be written as 

2 22 
/2(b)=min{^AK6^) mf ,X d^) 

where 

A(2, b) = |a|(o,2] e Mt : a2 ^ 7^„ b G -l("^=^iGr } • 

The workload at time zero is Wq = G^'^(a|(o,2]) = ai + [a2 — //™'^(a2)]^, which is equal to 
a(0, 2] — i/™'^(a2) since a2 ^ TZg. On the other hand, we require Wq = b. Hence, using the 
scheduler H""^ given in ([3]), we can express A(2,b) as A(2,b) = A(i) U A(2) U A(3), where 
A(j) C M;^, j = 1, 2, 3, are defined as 

A(i) := {(ai, aa) G : 4 > a^, > 1, a(0, 2] = b + (1, 0)}, 
A(2) := {(ai, a2) e : 4 < 4,4 > 1, a(0, 2] = b + (0, 1)}, 
A(3) := {(ai,a2) G M+ : 4 < 1,0^ < 1,4 + 4 > l,a(0,2] = b + Proj^^(a2)}. 
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Time i 

(a) Property 1 



9| I .. — ^ 
Path 1 
^" ^K— Path 2 

7- 



6- X 

Equal Line 




Queue 1 

(b) Property 2 

Fig. 1. Two properties of the minimum-cost sample patiis: (a) Property 1: tiie minimum-cost path is the constant-speed linear 
path, (b) Property 2: Path 1 which is closer to the Equal Line has a lower cost than Path 2. 
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Note that in the definition of A(i) and A(2) we have used the property highlighted in Remark [8l 
For the two user case the scheduling function H^'^(-) is only discontinuous at x G such 
that = > 1. Here one can either choose the service vector (1,0) or (0, 1) to obtain a 
quasi-continuous selection. Thus, both of these options have to be considered. 

Using these newly defined sets the second term in the RHS of (fTSi) can be rewritten as 

2 2 2 2 

inf yyA*(xf)=min inf VVA*(xf). 

t=l k=l 1=1 k=l 

Trajectories of some examples of the (accumulated) arrival sample paths are illustrated in 
Figure [2(a)] and their corresponding workload trajectories in Figure [2(b)j In particular, Figure [2(a)] 



shows example trajectories of the accumulated arrival sample paths a|(o,2] G = 1,2,3, in 

the calculation of /2(b), where b = (4,2). Also, for particular values of a|(o,2] £ ^{i), namely, 
a2 = (2.5, 1) and a(0, 2] = (5,2) Figure [2(b)] shows the workload paths W*^^). The same figure 
also displays examples corresponding to arrival paths in A(j), j = 2, 3. For the specific example 
from A(i) the figure shows that wj^^ = a2 = (2.5, 1) and wj^^ = a(0, 2] - (1, 0) = (4, 2). 

This example underlines the difficulty in finding the rate function even for small timescales. We 
expect that the number of constrained sets like A(j) will grow exponentially with time duration 
t. However, the example gives us some insight on how to find some simple upper and lower 
bounds to It for any t eN. 

E. Bounds on It 

In this subsection, we find simple expressions that give lower or upper bounds to infxgA(M,b) lK{^), 
which in turn give the bounds to It and J. We focus on K = 2 but similar results can be obtained 
for general K. 

Lemma 10: For K = 2, h E M^, It{h) can be bounded as 

K 



k=l 

and when b ^ [0, 1)^, 

K 



Itih)< mmuyAl(-ib'' + iu-l)H{h)')], (17) 

where the convex set X(n, b) C is defined as 

X(u,b) := {b + V : + = {u - l),v\v^ > 0}. (18) 
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(b) Trajectories of Workload Paths 
Fig. 2. Example of accumulated arrival and workload paths for calculation of /2(b). 
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Proof: See Appendix |Dl ■ 
Next we look at the tightness of the above bounds for an example of compound Poisson source 
process with exponential packet size. We expect the tightness to depend on the traffic load. 

F. Numerical Examples 

Here we compare the finite-horizon rate functions I2 for three schedulers, namely, max- weight, 
GPS with equal weights, and a priority scheduler that gives higher priority to queue 1. We also 
examine the tightness of the bounds given in Lemma [lOl Both of these are for an average of 
i.i.d. compound Poisson source processes with exponential packet size where the packet arrivals 
follow Poisson distribution of rate A and the average packet size is (see [40]). The function 
A* for this process is given by 

K\x) = - 

for X E IR+. Note that this has domain ]R_|_ and therefore satisfies Proposition [TJ We once 
again make the simplifying assumption that the processes for the different users are statistically 
identical and that the rate-region is the unit simplex. 

First we present results for the comparison of the different scheduling policies. Here we set 
A = 0.1 or 0.3 with the average packet size of = 100. Fig. |3(a)| and |3(b)| show the finite- 



horizon (two-timestep) rate function /2(b) for A = 0.1 and 0.3 respectively. However, these 
calculations are best appreciated when we compare them to the rate functions of other well- 



known scheduling policies. Fig. |4(a)| compares the max-weight scheduler with a GPS scheduler 
with equal weights both at A = 0.3. One can see that the rate function for the max-W scheduler 
is greater than the rate-function of the GPS scheduler for some range of b, i.e., where is 
much greater that b"^ and vice versa. This means that for this range of b and in the many-sources 
asymptotic sense, the work-conserving MW scheduler performs better than the work-conserving 
GPS scheduler (when we only consider the two time-step workload). The reason is that with 
b^ much greater than 6^, MW can serve queue 1 at full capacity (1), where GPS has to serve 
both queues equally at 1/2. Hence, the workload under MW is less likely to reach (6^,&^) at 
the end of the two time- slots when the system starts from being empty. Similarly |4(b)| compares 
the rate-functions for the max-weight scheduler and a priority scheduler where user 1 has higher 
priority. Since the max-weight policy does not discriminate between the two users, we find that 
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it is less likely for h"^ to large for the max-weight in comparison to the priority policy with the 
reverse being true for 6^ 

Next we compare our bounds for the rate function of the max-weight policy with the exact 



rate function in the scenarios where we can calculate it by by brute force. Figure |5(a)| shows the 
upper and lower bounds and the actual values of It, for t = 10, at = 0.01, and various values 
of A = 0.1,0.2,0.3 and when b = (6^,6^ = 1) for various values of h^. Figure [5(b)] shows the 
corresponding minimizing t* for the bounds and the actual expression of It- We note that for 
all b in this example, J(b) is actually equal to /t(b) for t = 10 since all optimizing t* is less 
than 10 (see (fT4)) ). This example shows that, in the range of b in consideration, both bounds are 
tight and almost coincide when the traffic load is small, i.e., A = 0.1. However, when the traffic 
load is higher, the lowerbound becomes loose while the upperbound is still considerably tight. 

It is interesting to note the optimal timescale t* which the queues most likely to take to reach 
the level b. Figure |5(b)| shows that, for example, it is most likely to take only two timeslots for 
CPE process with A = 0.2 to reach the buffer level b = (3, 1), while the most likely timescale 



is four timeslots when the traffic load is higher (A = 0.3). Figure [6(c)] and Figure [6(d)] show the 



optimal trajectories of the accumulated arrival process and the workload process for A = 0.2 
and 0.3, respectively. 

Note that despite a potential exponential growth in computation of /j, such computations 



commonly reduce to simpler cases. For instance, consider the calculation of Jio in Figs. [6(c) 



and [6(d)[ in which we sequentially calculate Ji, I2, h- However, this sequential computation stops 



as soon as one reaches the optimal timescale t* < t. For the values of the vector b in Figs. [6(c) 
and [6(d)[ for instance, t* was at most 4, making the calculations of Iq-Iiq unnecessary. 

Vin. Concluding Remarks 

In this paper, we have established a many-sources LDP for the stationary (infinite-horizon) 
workload for multi-queue single-server system with simplex rate region and under maximum- 
weight scheduling, when the arrival processes assumed to satisfy certain many-sources sample 
path LDP. To extend the LDP of the arrival processes to the LDP of the workloads, we employed 
Garcia's extended contraction principle, which applies to quasi-continuous mappings. Along the 
way, we also establish an LDP for the finite-horizon workload in a very general setting of 
arbitrary compact, convex, and coordinate convex rate region under max-weight scheduling. We 
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for MW 




(a) A = 0.1 
1^ for MW 



0.15 




(b) A = 0.3 

Fig. 3. Finite-horizon rate function /2(b) for tlie max-weigiit scheduler. 
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l^forMW- for GPS 




(a) Max-weight versus GPS 
for MW - for P 




(b) Max-weight versus Priority for user 1 

Fig. 4. Comparison of the rate-functions of the max-weight scheduler, the GPS scheduler with equal weights and the priority 
scheduler where user 1 has higher priority. 
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(b) Optimal Timescales 

Fig. 5. Example of the rate function /i()(b) and its upper and lower bounds, and their corresponding optimizing t* and optimal 
trajectories, when b = (6^,6'^ = 1). 
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(c) Optimal Trajectory for b = (3, 1) and A = 0.2 
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(d) Optimal Trajectory for b = (3, 1) and A — 0.3 
Fig. 6. The optimal accumulated arrival and workload trajectories when b — (3, 1) at A = 0.2 and A — 0.3, respectively. 
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gave the associated rate functions and the expression of the infinite-horizon rate function in term 
of the finite-horizon ones, when the arrival processes have i.i.d. increments. 

Next, we catalogue some interesting areas of future research. The extension of our LDP result 
for the infinite-horizon workload in the case of an arbitrary compact, convex, and coordinate 
convex rate region remains open by and large. The main difficulty in establishing an LDP for 
the infinite-horizon workload is in showing the quasi-continuity of the infinite-horizon workload 
mapping. In the case of simplex rate region, the infinite-horizon workload mapping was shown to 
be reducible to a finite-horizon mapping whose quasi-continuity was established via induction. 
Another question that has only been partially explored in the current paper is the nature of 
Assumptions [IB In particular, beyond Proposition 1, the relationship between stochastic mixing 
properties of the arrival process and the analytical properties of Pt and /" remains an important 
area of future research. 



Appendix A 
Proof of Lemma [H 

Lemma \T\ states that it is possible to construct quasi-continuous functions H and H^'^ such 
that 

H(Wt) e argmax < R, >, 

and 

I H(Wt) otherwise 
We do this using selection theorems (see [58] and [59]) for correspondences associated with 
max-weight scheduling: 

n{Wt) := argmax < R, > . (19) 

First we define the following analytical properties of correspondences. 

Definition 5 (Local-boundedness): A correspondence F : X ^ y is locally-bounded (lb) 
[56, Defn. 5.15, pp. 157-158] at point x E X if there exists a neighbourhood V of x such that 
F{V) := UaevFio-) is bounded in y. Correspondence F is deemed lb if it is lb for every point 

X e X. 
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Definition 6 (Outer semicontinuity): For a correspondence F : X define the outer-limit 
at x e A* to be 

limsupF(a) : = U^.n^^limsup F(a;") = {|/|3x" x, ^y"" y with y"" G 

a— >a; n— ►oo 

Then F is outer semicontinuous (osc) [56, Defn. 5.4, pg. 152] at x G A:" if limsup^^^ F(a) C 
F(x). Correspondence F is deemed osc if it is osc for every point x E X. 

Definition 7 (Upper semicontinuity): A correspondence F : X ^ y is upper semicontinuous 
(also known as upper hemicontinuous, [56, Thm. 5.19, pg. 160] and [57]) at point x E X if for 
any open neighbourhood V of there exists a neighbourhood U of x such that F{a) C V 

for all a E U. Alternatively, F is said to be upper semicontinuous at x G A" if whenever we have 
sequences {xm} E X and {ym} £ y such that G F{xm) for all m, Xm x and y, 
then ?/ G F{x). Correspondence F is deemed use if it is m5c for every point x E X. 

Now we can prove Lemma [TJ 

Proof of Lemma U} The scheduling function 7i(-) : i— P(7^) is a maximal monotone 
correspondence [56, Thm. 12.17, pp. 542-543] that picks closed and convex subsets of TZ for 
every x G M^; thus, compact subsets of M.^. It is, therefore, both lb and use from [56, Ex. 
12.8b, pg. 536]. Now it follows that we get a quasi-continuous selection by [58, Thm. 2] and 
[59, Thms. 2.4 & 2.5] since H(Wt{ )) is use and has compact values. 

To prove that H'"'^ is a quasi-continuous selection we first use the same steps as above but with 
a restricted domain, i.e., for n{-) : Mlfce^cP' ^k) ^ ViTZ) since K^\nfcgic[0' ^k) is a Baire 
space [42] so that results from [58], [59] still apply. From the definition of the work-conserving 
max- weight scheduler, if x G Ilfceict'-'' ^fe)' ^^^^ S^*- continuity from within this set from the 
properties of Proj7^(-). Thus, we can satisfy the definition of quasi-continuity from Defn. HJ ■ 

We refer the reader to [60, Thm. 2.2] and [61, Thm. 3.4] for an exposition and for general- 
izations of the result from [58], [59]. 

Appendix B 
Proof of Lemmas [2H3] 

Next we prove Lemmas [2] which uses the following fact: 

Fact 2: Assume X, y are metric spaces and Fi and F2 are functions from X onto y. If Fi 
is quasi-continuous at x E X and F2 is continuous at x, then Fi + F2 is quasi-continuous at x. 
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Below we state and prove Lemma [2] for a simplex rate region TZg to avoid unnecessary 
notational complexities; the result readily extends to a general rate region as discussed in 
Remark 13 

Lemma |2l- For t E N, GJ^ for IZs is quasi-continuous on R;^* with respect to the scaled 
uniform norm topology. 

Proof: Using our queueing equation we first observe the following recursive relation between 
and Gjl^ for any t G {2, 3, . . .} and x e V^: 

Gr(x|(o,t]) = xi + [Gri(x|(i,]) - //-(G-,(x|(i,]))]+, (20) 

where we used the fact that W^=(x|(o,t]) = Gr(x|(o,t]), and W7^(x|(i,t]) = G7_=i(x|(i,t]) when 
the initial backlog at time —t is 0. 

Equation says that G^'^(x|(o,t]) depends linearly on Xi. This implies the following simple 
but consequential observations: 

Observation 1: If GJ'^ is quasi-continuous at x|(o,f], then it is quasi-continuous at x|(o,j] := 
(xi,X2, . . . ,Xt) for any xi G M^, and if GJ'^ is continuous at x|(o,t], then it is also continuous 
at x|(o,t]. 

Observation 2: If G'7'^'^^(x"|(o,t]) — > G'7'^(x|(o,j]) for a sequence {x"|(o,t]} such that x"'|(o,t] — > 
x|(o,t], then for any sequence {x"|(o,t] = (x^, Xj, . . . , x")} where x" xi, we also have 
G7=(x"|(o,i]) G7'=(x|(o,j]). 

Using the recursive relation in (l20l) . we prove this lemma by induction on t G N. For every 
t G N we assume that the system is empty at —t. Therefore, the arrival sample-path prior to —t 
has no influence on Wq^. Thus, we will only specify the values of the constructed sequences up 
until time t; the extension to sample-paths in "D^ while ensuring that the system is empty at —t 
is trivial. For t = 1, Gj^{ai) = ai, hence G"'^ is continuous on W^. Assuming that GJ^'^ is quasi- 
continuous on M^*, we want to show that GJ!^^ is quasi-continuous on M;^*^*^^^ Using the fact 
that the function is continuous. Remark |2l and Fact[2l it suffices to show that the function 
Ft := GJ^ — H^'^ o G"'^, is quasi-continuous on M;^* to show that Gj^i is quasi-continuous. In 
particular, for any arrival sample path a G Vj^, we need to show that Ft is quasi-continuous at 
a|(o,f]. It suffices to show that it is possible to select a sequence a" ^ a (in V^) for which 

Gr(a"|(o,t])^Gr(a|(o,t]), (21) 

^wc ^ G'r(a"|(o,t]) ^ H^" o Gr(a|(o,i]), (22) 
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such that both and H'"^ o GJ'^{-) are continuous at every a"|(o,t]- Note that in contrast to 

Fact [2l we are adding two quasi-continuous and showing that the sum is quasi-continuous; the 
key to our proof is to ensure that we use the same sequence for both functions. 

We show this by first noting that the induction hypothesis, i.e., quasi-continuity of GJ^, and the 
definition of quasi-continuity ensure that there exists a sequence {a"} such that a" a, in the 
scaled uniform norm topology, such that G'"'^(a"|(o,t]) G'"'^(a|(o,t]), and Gj'^{-) is continuous 
at a"|(o,t] for all n. We will construct the desired sequence {a"} by modifying a" appropriately. 
We proceed by considering the following two cases, depending on the value of ai. 

Case 1: ai > 0, i.e., every component of the ai G is positive. Let e > be the smallest 
component of ai, i.e., e = minfce^cc^i- Since H{-) is quasi-continuous, it is possible to choose a 
sequence of (workload) vectors {w"} such that w" G'7'^(a|(o,t]), and H is continuous at w" 
for all n. Now, we define 

a- := w" - [Fi_i(a"|(i,<])]+ = - G'r(a"|(o,t]) + a" 

(23) 

= (w" - GT{^\m)) + {G7\^\m) - GTi^^^A)) + (^i - ai) + ai. 
It is clear from the last relationship that a^ ai. We still need to ensure that a" > since 
negative quantities are involved in the definition. We do this by using the facts that every 
component of ai G is greater than or equal to e > 0, and that w''^ G'7'^(a|(o,t]), 
Gj^'^(a"|(o,t]) G't(a|(o,t]), and a" ai. These facts imply that there exists an such that 
for all n > we have ||w" - G7=(a|(o,t])|| < e/3, | |Gr'^^(a|(o,j]) - Gr(a"|(o,t])| I < e/3 and 
||a" — ai|| < e/3 (with the square norm) which then together with (l23l) imply that, for the 
sequence a^*"*""', we always have non-negativity of all components. Hence, we construct a new 
sequence {a^ot]} where a^ = a"+'"' and aj\ = a|\+"\ 

This new sequence a^Q is the sequence we are after because using the induction hypothesis 
together with Observations [H and [H we have that G'^'=(a"|(o,f]) -> G'^''(a|(o,4]), and G^"" is 
continuous at a"|(o,j] for all n. Furthermore, by construction 

GT{^\oa) = a? + [Fi„i(a"|(i,i])]+ = al'+"^ + [Fi„i(a-+"^ |(i,,])]+ = w"+-^ (24) 

Hence, we have shown that there exists a sequence a^^^^ satisfying (|2T)) and (|22)) . In addition, 
the continuity of H'"'^ o GJ'^ at a"|(o,t] for all n is a direct consequence of continuity of GJ^ at 
a"|(o,t] and continuity of H^^ at w"+"% which is equal to G7'^(a"|(o,t]), for all n. 
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Case 2: ai > 0. Let /Ci := {/c G /C : a\ = 0} and let /C2 := a.TgmaXkeK.^oi^\{o,t])Ck- 
Without loss of generality, by permuting the user labels we can assume that the first K : = 
|/Ci U /C2I components of ai are either in JCi or in /C2; thus, the rest of the K — K components 
are both positive and not part of the scheduling decision made at time with arrival sequence 
a(o,t] . Now consider the sequence a"' := [1 /mC]^. + ai where [1/mC] ^ is short-hand for a vector 
with l/{mCk) in the first K components and in the remaining coefficients; by construction a^ 
converges to ai such that for every m every component of a™ is positive. We construct a sequence 
{a'"|(o,t]} with this a™ and ar\{i,t] = a|(i,t]. It is obvious that G7"(a™|(o,t]) ^ G7^(a|(o,t]) since 

Gr(a'"|(o,]) = ar + [F,_i(a-|(i,])]+ = a^ + [F,_i(a|(i,])]+ = [1/mC]^ + Cn^lm)- 

When G'7'^'^°(a|(o.t]) ^ [0,C)^, by construction, we have 

^wc ^ cn^nm) = (Gr(a'"|(o,t])) = (Gr(a|(o,i])) = H o Gr(a|(o,i]), 

where the function H{-) is the regular max-weight scheduling function (with TZg). On the other 
hand, if G7^(a|(o,t]) G [0,C)^^, then the continuity of Proj^^(-) yields i/"^ o GT(a™|(o,t]) 
i/-oGr(a|(o,t]). 

Since for each m we have that a^ has all elements strictly positive, we can use the construction 
from Case 1 but with a'"|(o,t] in place of a|(o,t]. In particular, for each m, we can now generate 
a sequence {a'"'"} such that a|™'" — > a^ as n — > +00, a™'"|(i j] = a"|(it], and by using 
Observations \T\ and |2l the following hold 

Gr(a"''1(o,t])^Gr(a™|(o,i]), (25) 
^wc ^ ^wc^~™,n|^^^^^) ^ ^wc ^ G'r(a'"|(o,t]), (26) 

with both GY^{-) and i/™^ o G^^-) being continuous at a'"'"|(o,j] for all m, n. 

Now we define the sequence a'" = a™'"* as the sequence we are after. By construction, 
we have ar\(o,t] ^ a|(o,t] and both GT(-) and H""^ o GT(-) continuous at all a"|(o,t]. Since 
Gr(a"|(o,t]) ^ G7^(a|(o,t]) and if"^ o G'r(a'"|(o,t]) ^ H^^' o G'r(a|(o,t]), it follows from m 
and (1261) that Gr(a'"|(o,i]) ^ Gr(a|(o,t]) and o Gr(a™|(o,t]) ^ i/"^ o Gr(a|(o,t]). ■ 

Remark 9: The proof above can be carried out for every rate-region in the class that we are 
interested in. The argument presented in Case 1 would remain exactly the same but the argument 
presented in Case 2 would have to be modified to account for a further characterization of 



February 26, 2009 



DRAFT 



36 



argmaxRgTe < Wo(a|(o,t]), R >, especially when it is not a singleton. The components of ai will 
need to be adjusted in such a manner so as to not perturb argmaxRgT^ < Wo(a'"|(o,t]), R > for 
the adjusted sequence a"'|(o.t]- As the case of a non-singleton argmaxRgT^ < Wo(a'^|(o,j]), R > 
will correspond to a specific set of values of Wo(a|(o,j]) (a cone) such that the boundary of 
the rate-region and a hyper-plane intersect at more than one point, we will need to use the 
normal corresponding to the hyper-plane in constructing the appropriate perturbation. Thus, on 
a case-by-case basis the same argument can be carried out for every rate-region. 
Finally, we prove Lemma [H 

Lemma\3\- For t G N, Gt{-) is quasi-continuous on M;^^*. 
Proof: Consider the queueing equation, i.e., 

Wt-1 = [Wt - Rr]+ + ar T G N, 

where R(.) G 7Y(W(.)) where 7i(-) is the maximal monotone correspondence defined in (fT9l) . 
Assume that we start the system at (fixed) time t E N with workload vector G M;^; we 
will often assume that = 0. First, for t G N we define a correspondence ^((W^, a|(o,t]) : 
]]j(t+i)A _^ -^K j-jj^j. j-gpresents all possible workload vectors at time that can be achieved from 
the inputs (Wf, a|(o,t]). This results from the successive application of the queueing equation 
where we use all possible values in 7Y(-) based upon the workload vector that results at each 
step. Our goal is show that Qt{-) admits a quasicontinuous selection which we shall call Gt{-)- 
For t > 1 it suffices to establish that 

^i_i(Wi,a|(i,,]) -7^(^i_i(Wi,a|(i,i])) : = 

{x G R-^ : X = y - z for some y G ^t-i(Wt, a|(i_t])and z G 7i(^t-i(Wt, a|(i,t]))} 

admits a quasicontinuous selection which we shall call (with an abuse of notation) Gt~i(^t, t]) — 
i/(G'^_l(W^, a|(i t])): 1) since [-j^ is a continuous function, we have 

[G'(_i(W(, a|(i t]) — H{Gt-i(Wt, a|(i f]))]+ being a quasi-continuous function; and finally, 2) by 
properties of projections and by the definition of quasi-continuity we get the quasi-continuity of 
Gt(Wt, a|(o,t])- We should add a note of caution here that even though for notational convenience 
we write 

G't(Wt,a|(o,i]) = [G't_i(Wi,a|(i,i])-i7(G't„i(Wi,a|(i,t]))]+ + ai, 
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it need not be the case that the quasi-continuous selection that we obtain for Gt be related 
using the above queueing equation to the quasi-continuous selection for Gt-i- Additionally, we 
may not even use the quasi-continuous selection H{-). Therefore, the property highlighted in 
Remark [8] need not hold. 

The proof will once again use mathematical induction where our induction step will assume 
that Qt~i{-) is osc. Since Wq < Wt + a(0,t] for any Wq G ^t(Wt, a|(o,t]), it follows that 
^t(Wt, a|(o,t]) is lb. Now using [56, Ex. 12.8b, pg. 536] we know that 'H{-) is both lb and osc, 
and therefore by [56, Thm. 5.19, pg. 160] it is also use. Then using [56, Prop. 5.52b, pp. 184- 
185] we have li.{Qt~i{-)) being osc. Therefore it also follows that Qt~i{-) — 'H{Qt~ii )) is also 
osc. Once this has been demonstrated the induction step is very easy as Qti') is obtained from 
^t-i(')) ~ ^(^t-i(')) by continuous transformations, as mentioned above. This same method 
also allows us to establish the initial step of the induction procedure; note that we will be dealing 
with n(Wt) and - 7^(W^) in this case. 

Since gt^^{-)-H{gt-i{-)) is lb, using [56, Ex. 12.8b, pg. 536] we have Gt-ii') - HiGt^ii-)) 
also being use. Finally, we get a quasi-continuous selection by [60, Thm. 2.2] and [61, Thm. 
3.4] since Qt-i{ ) — 'H{Gt-i{ )) is use and takes compact values. 

The required result then follows by setting G'((a|(o,j]) = GtiO, ^\{o.t])- ■ 

Appendix C 
Proof of Lemmas [5}l7] 

First we prove Lemma [5j 

Lemma \5\ Consider an arrival process a G Vj^. There exists a s* = s*{a) < oo such that 
the workloads at time —s* under a falls within the rate region TZ^, i.e., G"'^(a|(s. oo]) ^ T^s- 
Furthermore, for any sequence of arrival processes {a" G I^^} converging to a (in scaled 
uniform topology), the workloads at time —s* under a", when n is large enough, also fall within 
the rate region TZs, i.e., 3no such that G"'^(a"|(s.,oo]) ^ T^s for n > uq. 

Proof: Consider the normalized sum arrivals and the normalized sum workloads, and follow 
the proof in [24], [26] for the (aggregate) single-queue scenario. Given the definition of H^^ 
and the simplex capacity region TZg, the queue dynamics for the normalized sum workload is 
that of a single queue whose arrivals are the normalized sum of the arrivals, i.e., 

W,_i = [Wi-l]+ + ai, (27) 
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where recall that the hat (") notation means the normalized sum over all users, i.e. = 
Ylk=i'^^ 1^^ ^'^'^ ~ J2k=i^t' /^^- Recursion of the queue dynamics (ITTI) and letting 
T ^ oo where £ T^s, gives the standard expression for the infinite-horizon sum workload 
[26]: 

Wo := G{a) = sup a(0, t] - {t - 1), (28) 

teN 

where G represents the infinite-horizon normalized sum workload mapping; in other words, for 
all s, Ws = G{a\(s,oo]) represent the normalized sum workload at time s, under arrival sequence 
a. 

To prove the lemma we use the fact that the rate region TZg is simplex, hence < 1 -v^ 
Ws G TZs- Thus, it suffices to show that there is a finite and a finite s such that G'(a|(s,oo]) < 1, 
and for n > n'^, G'(a'"|(s,oo]) < 1- 

Since a G Vj^, there is a to < oo such that for all e > 0, t > to and k E K, ^L-^ < 
jjJ' + eG'^ . Since /x G int(7^s), we choose e = (1 — jji)/AK. We now have that for all t > to, 
^^j^ < fi + eK = (1 + 3/i)/4 < 1. In other words, the workload at time zero is a function of 
only the arrivals within time (0,to] and hence, 

Wo(a) = sup a(0,t] - (t - 1). (29) 

l<t<to 

Let s* < to < oo be the minimum values of the optimizing t's in the above equation. It can 
be shown [26, Lemma 5.4] that 

G'(a|(^.,oo]) < 1- 

It is known that G is continuous on Dp, [24, Lemma 13] when (1 < 1. However, this together 
with continuity of shift mapping implies that for all {a"} such that a" converges to a in scaled 
uniform topology, 

G'(a"|(„,oo]) ^ G(a|(„,oo]) for all u G [0, s*]. (30) 

In particular, (|30l) implies that there exists uq such that for all n > uq, the normalized sum 
workload under arrival sequence a" at time u = s* is no more than 1 packets, i.e., 

G{si^\{s*,oo]) < 1 for all n > uq. 

However, since the rate region is a simplex, the workload vectors at time s*, under a and a" lie 
in the rate region TZg. Hence, we have the assertion of the lemma. ■ 
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Next, we prove Lemma [6l 

Lemma ^ Let G mi{lZs), a be an arrival sequence with rate /x with /''(a) < +00, and 
W = G™'^(a) be its corresponding steady state workload. For any W G **G™'^ there exists 
a sequence of arrivals {a" G V^} such that a" converges to a in the scaled uniform norm 
topology, G^^(a") G'"^(a), G"^ is continous at a", and /»(a") ^ l\a). 

Proof: Lemma [5] implies that for any given arrival sequence a G V^, there exists a s* such 

that 

G-(a) = G-(a|(o,.*]). 

However, G^i^ is quasi-continuous on M;^^* . This implies that there exists a sequence of finite 
arrivals {a"|(o,s.]} such that 

1) ^"-1(0,5*] a|(o,s*]; 

2) G^^ is continuous at a"|(o,s*]; and 

3) G7^(a"|(o,s*]) GJ^(a|(o,s*]). 

Now construct the sequence of arrivals {a"} via concatenation of a|(s.,oo] and a"'|(o,s*]- It is 
immediate that a" a. 

Appealing to Lemma [51 for n large enough (greater than uq) we have 

1) convergence of G"'^(a") to G™'^(a) since 

G-(a") = G-^(a"|(o,..]) ^ G-^(a|(o,..]) = G-=(a); 

2) continuity of G'^'^(-) at a". For any sequence converging to a" in by appealing to 
Lemma [5] we know that far enough along every sequence only the arrivals in (0, s*] 
matter. Now using the fact that projection is continuous on V^, we get the result from 
the continuity of G^^ at a"|(o,s*]- 

This establishes the quasi continuity of function G""^. Lastly, Assumptions |3] and |2] ensure that 
jS(a-) ^ Jtt(a). ■ 
Finally, we prove Lemma |71 

Lemma [2' If /x G int(7^s), the mapping G^^{-) is almost compact on with respect to the 
scaled uniform norm topology. 

Proof: This follows almost exactly along the same lines as the proof of Lemma [6l For any 
a" a we proved the existence of a no such that for n > uq such that the workload vectors 
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only depended on the arrivals within time (0, to] - Thus, the proof of almost compactness simply 
follows from Lemma IH Note that we have used the fact that the projection operator is continuous 



on Pff. 



Appendix D 
Proof of Lemma [TO] 

Next we prove Lemma [10] which gives the bounds on If. 
Lemma \Wj; For K = 2, b G Mi, /t(b) can be bounded as 



W>min.f A;(i(Proj,,„,,,(0 



K 

U 

and when b ^ [0, 1)^, 

K /I 

It{h) < min nV A* -{ij' + {u - l)iJ(b)^" 

where we recall that the convex set X(m, b) C is defined as 

X(m, b) :={b + V : V G and t;^ + = (m - 1)}. 

Proof: Let b G M+, time u G (0,t], arrival path a|(o,j] G A(n,b), and Wj G be the 
workload vector at time —i for i G (0,m]. Assume without loss of generality that t > 1 as it is 
easy to see that both bounds turn to be /i(b). Owing to this we can also assume that n > 1 
since the terms corresponding to n = 1 in both bounds evaluate to /i(b). 

We first show the lowerbound (]T6]) . As we have noted earlier that for a|(o,j] G A(m, b), the [■]"'" 
function can be removed from the queue dynamics. Hence, we have a(0, u] = ^ + J21^=i H{Wi), 
where W„ G TZs and ^ TZs for all i G (0, n - 1]. Using this and the fact that H(Wi) G {v G 
: t"^ + t"^ = 1}, for all i G {0,u — 1], we have a(0,M] G X(m, b) where X(m, b) is defined 
above. Now, given any point d G X(m, b), the constant- speed linear path with increments of d/u 
is the minimum-cost path among all the paths with the same destination (using Property 1). In 
addition, among all the paths to destinations in X(M,b), the closest constant- speed linear paths 
a*|(o,u] to the equal line is the minimum-cost path (using Property 2). Since the closest point in 
X(n,b) to the equal line is Projx(„,b)(0)' we have a*|(o,«] = (a* = ^Projx(„,b)(0)' ^ ^ (0,u]). 
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Since the set of paths with destination in X(m, b) includes all paths in A(u,h), from (|7]) we 
have the lowerbound (fT6l) : 

K u 

/t(b) = min inf y^y^Ai(xf)> min inf Jt(x) 

uG{0,t] xGA(«,b)^-^ «e(0,t] xeI8^*:x(0,«]GX(«,b) 

K 

mm 



inf /„(x)= min n ^ (Proj5,(,b)(0))'') 



Me(o,t] xeR^":xeX(?i,b) 

To show the upperbound (flTl) . we only need to show that the constant-speed linear path 
a|(o,u] = (aj = ;^(b + (n — l)if(b)), -i G (0, u]), is in A(n, b), when b ^ [0, 1)^. Without loss of 
generality, we consider only when 6^ > }? and 6^ > 1. In this case, we set i/(b) = (1,0) and 
the queue dynamics gives 

W, = ^^i^(b + - 1)(1, 0)) - - 1 - 0(1, 0), 

for all z G (0, n — 1]. Since 6^ > 1, we have W} > 1 and Wl > Wf, and hence we can once 
again set HiWi) = (1,0) for all i e {0,u— 1]. Hence, a|(o,M] G A{u,h)). ■ 
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